Improved Automatic Maturity Assessment of Wikipedia Medical Articles - (Short Paper)

نویسندگان

  • Emanuel Marzini
  • Angelo Spognardi
  • Ilaria Matteucci
  • Paolo Mori
  • Marinella Petrocchi
  • Riccardo Conti
چکیده

The Internet is naturally a simple and immediate mean to retrieve information. However, not everything one can find is equally accurate and reliable. In this paper, we continue our line of research towards effective techniques for assessing the quality of online content. Focusing on the Wikipedia Medicinal Portal, in a previous work we implemented an automatic technique to assess the quality of each article and we compared our results to the classification of the articles given by the portal itself, obtaining quite different outcomes. Here, we present a lightweight instantiation of our methodology that reduces both redundant features and those not mentioned by the WikiProject guidelines. What we obtain is a fine-grained assessment and a better discrimination of the articles’ quality, w.r.t. previous work. Our proposal could help to automatically evaluate the maturity of Wikipedia medical articles in an efficient way.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Strategy Oriented , Machine Learning Approach to Automatic Quality Assessment

A Strategy Oriented, Machine Learning Approach to Automatic Quality Assessment of Wikipedia Articles Gabriel De La Calzada This work discusses an approach to modeling and measuring information quality of Wikipedia articles. The approach is based on the idea that the quality of Wikipedia articles with distinctly different profiles needs to be measured using different information quality models. ...

متن کامل

A Matter of Words: NLP for Quality Evaluation of Wikipedia Medical Articles

Automatic quality evaluation of Web information is a task with many fields of applications and of great relevance, especially in critical domains like the medical one. We move from the intuition that the quality of content of medical Web documents is affected by features related with the specific domain. First, the usage of a specific vocabulary (Domain Informativeness); then, the adoption of s...

متن کامل

Automatic Wikipedia Link Generation Based On Interlanguage Links

This paper presents a new way to increase interconnectivity in small Wikipedias (fewer than a 100, 000 articles), by automatically linking articles based on interlanguage links. Many small Wikipedias have many articles with very few links, this is mainly due to the short article length. This makes it difficult to navigate between the articles. In many cases the article does exist for a small Wi...

متن کامل

Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles

Despite the fact that Wikipedia is often criticized for its poor quality, it continues to be one of the most popular knowledge base in the world. Articles in this free encyclopedia on various topics can be created and edited in about 300 different language versions independently. Our research showed that in language sensitive topics quality of information can be relatively better in the relevan...

متن کامل

Relative Quality Assessment of Wikipedia Articles in Different Languages Using Synthetic Measure

Online encyclopedia Wikipedia is one of the most popular sources of knowledge. It is often criticized for poor information quality. Articles can be created and edited even by anonymous users independently in almost 300 languages. Therefore, a difference in the information quality in various language versions on the same topic is observed. The Wikipedia community has created a system for assessi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014